Ensembling Classifiers - An application to image data classification from Cherenkov telescope experiment
نویسندگان
چکیده
Bagging have been in active research and shown improvements in classification results for several benchmarking data sets with mainly decision trees as their base classifiers. In this paper we experiment to apply these Meta learning techniques with classifiers such as random forests, neural networks and support vector machines. The data sets are from MAGIC, a Cherenkov telescope experiment. The task is to classify gamma signals from overwhelmingly hadron and muon signals representing a rare class classification problem. We compare the individual classifiers with their ensemble counterparts and discuss the results. WEKA a wonderful tool for machine learning has been used for making the experiments. I. PROBLEM DOMAIN AGIC [1] is the Cherenkov telescope used to detect the gamma rays from the outer universe. It is designed to provide vital information on several established gamma-ray sources, like Active Galactic Nuclei, Supernova Remnants, Gamma Ray Bursts and Pulsars. It collects gamma ray events (in little quantities) along with many other particle events represented as images shown in the (Fig.1). The pixels making up the image can be converted to some set of image parameters also called as hillas parameters by various image processing and feature extraction techniques [2], which statistically allow a separation of events. A gamma ray signal defines an ellipse in the camera plane of the telescope where as the other showers make up an error ellipse plane (Fig.1). The data sets used in the experiment contain 10 image parameters. Due to atmospheric radiations, the ground based telescope collects overwhelming events of hadrons and muons also called as background. To understand the gamma ray sources, it is an important task for separating gammas from other particles. There is only a weak discrimination between the gamma and background events, making the data an excellent proving ground for the classification techniques [2]. Things are further complicated by added noise in the data collection by the hardware bias in the telescope. As the sources of high-energy gammas are few and their signal comparatively weak, creating a case for the rare class classification problem. It will be a daunting task to separate the gamma signals from these overwhelming sea of background signals, both having very similar characteristics.. Classification of signals plays a vital role for making the astronomical analysis of gamma ray objects, and any small improvements in the classification accuracy will be significant in these analysis tasks. Automated classification of vector data into …
منابع مشابه
Selectively Ensembling Neural Classifiers
Ensembling neural classifiers can significantly improve the generalization ability of classification systems. In this paper, GASEN, a genetic algorithm based selective ensemble method that has been shown to be excellent in ensembling neural regressors, is applied to neural classifiers. Experiments on four large data sets show that this method can generate ensembles of neural classifiers with st...
متن کاملHow to focus a Cherenkov telescope
Cherenkov telescopes image the Cherenkov emission from air showers. A priori, it is not obvious if the ‘best’ images are achieved by measuring Cherenkov photon angles, i.e. focusing the telescope at infinity, or by considering the air shower as an object to be imaged, in which case one might focus the telescope on the central region of the shower. The issue is addressed using shower simulations...
متن کاملInsights into Machine Learning: Data Clustering and Classification Algorithms for Astrophysical Experiments
Data analysis domain dealing with data exploration, clustering and classification is an important problem in many experiments of astrophysics, computer vision, bioinformatics etc. The field of machine learning is increasingly becoming popular for performing these tasks. In this thesis we deal with machine learning models based on unsupervised and supervised learning algorithms. In unsupervised ...
متن کاملMicrosoft Word - Finding More Non-supersingular Elliptic Curves for Pairing..
Ensemble learning algorithms such as AdaBoost and Bagging have been in active research and shown improvements in classification results for several benchmarking data sets with mainly decision trees as their base classifiers. In this paper we experiment to apply these Meta learning techniques with classifiers such as random forests, neural networks and support vector machines. The data sets are ...
متن کاملSUBCLASS FUZZY-SVM CLASSIFIER AS AN EFFICIENT METHOD TO ENHANCE THE MASS DETECTION IN MAMMOGRAMS
This paper is concerned with the development of a novel classifier for automatic mass detection of mammograms, based on contourlet feature extraction in conjunction with statistical and fuzzy classifiers. In this method, mammograms are segmented into regions of interest (ROI) in order to extract features including geometrical and contourlet coefficients. The extracted features benefit from...
متن کامل